Harvesting Related Entities with a Search Engine
نویسندگان
چکیده
This paper addresses the problem of related entity extraction and focuses on extracting related persons as a case study. The proposed method builds on a search engine. Specifically, we mine candidate related persons for a query person q using q’s search results and the query logs containing q. The acquired candidates are then automatically rated and ranked using a SVM regression model that investigates multiple features. Experimental results on a set of 200 randomly sampled query persons show that the precision of the extracted top-1, 5, and 10 related persons exceeds 91%, 90%, and 84%, respectively, which significantly outperforms a state-ofthe-art baseline.
منابع مشابه
Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore
Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...
متن کاملExplaining relationships between entities
Modern search engines are increasingly aiming to understand users’ intent in order to answer information needs more effectively by providing richer information than the traditional “ten blue links”. This information might include context about the entities present in the query, direct answers to questions that concern entities and more. A recent trend when answering queries that refer to a sing...
متن کاملIdentifying the Names of Complex Search Tasks with Task-Related Entities
Conventional search engines usually consider a search query corresponding only to a simple task. Nevertheless, due to the explosive growth of web usage in recent years, more and more queries are driven by complex tasks. A complex task may consist of multiple sub-tasks. To accomplish a complex task, users may need to obtain information of various task-related entities corresponding to the sub-ta...
متن کاملMeasuring the Weight of Relations Between Entities
Extracting relations among entities is an active research area of Semantic Web studies related to semantic research and information inference. Although many studies have proposed extraction of large-scale relational data, how to weight each relation has not been well studied. Intuitively, a relation between two entities might be more important than relations between other entities. Therefore, t...
متن کاملEntity-oriented Search Engine Result Pages
Modern search engine result pages often contain a mixture of results from structured and unstructured sources. Where such mixtures of structured and unstructured information are called for, the state-of-the-art is to organize complex search engine result pages around entities. Generating such a mixture of entity-oriented results in response to a traditional keyword query raises a number of inte...
متن کامل